Generalized Distillation Framework for Speaker Normalization
نویسندگان
چکیده
Generalized distillation framework has been shown to be effective in speech enhancement in the past. We extend this idea to speaker normalization without any explicit adaptation data in this paper. In the generalized distillation framework, we assume the presence of some “privileged” information to guide the training process in addition to the training data. In the proposed approach, the privileged information is obtained from a “teacher” model, trained on speaker-normalized FMLLR features. The “student” model is trained on un-normalized filterbank features and uses teacher’s supervision for cross-entropy training. The proposed distillation method does not need first pass decode information during testing and imposes no constraints on the duration of the test data for computing speakerspecific transforms unlike in FMLLR or i-vector. Experiments done on Switchboard and AMI corpus show that the generalized distillation framework shows improvement over un-normalized features with or without i-vectors.
منابع مشابه
A Bayesian Framework for Score Normalization Techniques Applied to Text Independent Speaker Verification
The purpose of this paper is to unify several of the state-of-the-art score normalization techniques applied to text-independent speaker verification systems. We propose a new Bayesian framework for this purpose. The two well-known Zand T-normalization techniques can be easily interpreted in this framework as different ways to estimate score distributions. This is useful as it helps to understa...
متن کاملSegmental Normalization for Robust Speaker Verification
For the task of speaker verification, similarity measure normalization methods are relevant to cope with variability problems and with data and/or decision fusion problems. The aim of this paper is to suggest a new method of normalization which combines classical world model based normalization techniques with ones based on a posteriori probability. This original method presents the well-known ...
متن کاملStructural framework for combining speaker recognition methods
The paper describes a structural framework for the design of a speaker recognition system based on multiple models. This combination is not only at the recognition level, but also at a joint training of the models. This unified training of the models uses a common structure : a decomposition tree of the set of data of normalization speakers. For the experiments, the Gaussian Mixture Model and t...
متن کاملGeneralized Modular Framework for Distillation Column Synthesis
In this work the distillation column sequencing problem is addressed through the Generalized Modular Framework, based on formal superstructure optimization techniques. The proposed method overcomes structural complexities through the use of systematically composed structural models incorporating all the feasible sequencing alternatives. The generated sequences are evaluated with respect to thei...
متن کاملExperiments on speaker profile portability
This paper addresses the problem of speaker characterization in the speaker-dependent speech recognition problem. Speaker Adaptation and Normalization techniques are designed to reduce the mismatch introduced by inter-speaker variability. Yet there is another source of mismatch introduced by intra-speaker variability. Indeed, the speaking style of a speaker depends on the nature of the speech u...
متن کامل